Estimation of Gene Insertion/Deletion Rates with Missing Data.

نویسندگان

  • Utkarsh J Dang
  • Alison M Devault
  • Tatum D Mortimer
  • Caitlin S Pepperell
  • Hendrik N Poinar
  • G Brian Golding
چکیده

Lateral gene transfer is an important mechanism for evolution among bacteria. Here, genome-wide gene insertion and deletion rates are modeled in a maximum-likelihood framework with the additional flexibility of modeling potential missing data. The performance of the models is illustrated using simulations and a data set on gene family phyletic patterns from Gardnerella vaginalis that includes an ancient taxon. A novel application involving pseudogenization/genome reduction magnitudes is also illustrated, using gene family data from Mycobacterium spp. Finally, an R package called indelmiss is available from the Comprehensive R Archive Network at https://cran.r-project.org/package=indelmiss, with support documentation and examples.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Twisted trees and inconsistency of tree estimation when gaps are treated as missing data - The impact of model mis-specification in distance corrections.

Statistically consistent estimation of phylogenetic trees or gene trees is possible if pairwise sequence dissimilarities can be converted to a set of distances that are proportional to the true evolutionary distances. Susko et al. (2004) reported some strikingly broad results about the forms of inconsistency in tree estimation that can arise if corrected distances are not proportional to the tr...

متن کامل

Association of Prolactin and Prolactin Receptor Gene Polymorphisms with Economic Traits in Breeder Hens of Indigenous Chickens of Mazandaran Province

Polymorphisms in 5’-flanking region of prolactin (PRL), exon 2 and exon 5 of prolactin receptor (PRLR) genesand its association with growth and egg traits were examined in breeder hens of Mazandaran native fowlsbreeding station. A single nucleotide polymorphism at site C-2402T and a 24 bp nucleotide sequence insertionat situation -382 in 5’-flanking regions of PRL gene were id...

متن کامل

Polymorphisms of prolactin gene in a native chicken population and its association with egg production

The induction and regulation of broodiness is of the most important role of prolactin in avian species.The promoter region of the prolactin gene is an appropriate model for studying tissue-specific andhormonally-regulated activation of gene transcription. In this study, the association between prolactinpromoter region alleles and egg production in Fars native chickens was investigated. In total...

متن کامل

Standard maximum likelihood analyses of alignments with gaps can be statistically inconsistent

BackgroundMost statistical methods for phylogenetic estimation in use today treat a gap (generally representing an insertion or deletion, i.e., indel) within the input sequence alignment as missing data. However, the statistical properties of this treatment of indels have not been fully investigated.ResultsWe prove that maximum likelihood phylogeny estimation, treating indels as missing data, c...

متن کامل

Insertion/deletion polymorphism of angiotensin-converting enzyme and chronic obstructive pulmonary disease: A case-control study on north Indian population

This research aimed to explore the ACE (insertion/deletion) gene association as key factor for chronic obstructive pulmonary disease (COPD) development in north Indian population. A total of 200 clinically diagnosed patients with COPD were selected against 200 healthy individuals. Genetic variations of ACE (insertion/deletion) were evaluated by using polymerase chain reaction ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Genetics

دوره 204 2  شماره 

صفحات  -

تاریخ انتشار 2016